Political Leaning Categorization by Exploring Subjectivities in Political Blogs
نویسندگان
چکیده
This paper addresses a relatively new text categorization problem: classifying a political blog as either ‘liberal’ or ‘conservative’, based on its political leaning. Instead of simply using “Bag of Words” features (BoW) as in previous work, we have explored subjectivity manifested in blogs and used subjectivity information thus found to help build political leaning classifiers. Specifically, our subjectivity based approach is two fold: 1) we identify subjective sentences that contain at least two strong subjective clues based on the General Inquirer dictionary; 2) from subjective sentences identified, we extract opinion expressions and BoW features to build political leaning classifiers. Experiments with a political blog corpus we built show that by using features from subjective sentences can significantly improve the classification performance. In addition, by extracting opinion expressions from subjective sentences, we are able to reveal opinions that are characteristic of a specific political orientation to some extent.
منابع مشابه
Finding Political Blogs and Their Political Leanings
The Blogosphere has more influence on both the general public’s opinions and mainstream media nowadays. As vast political blogs representing grassroots voices go online, it is both interesting and useful to find them and determine their political leanings as either liberal or conservative. In this paper, we address both problems by using front pages of blogs in a corpus we built to create two c...
متن کاملPreliminary Semantic Analysis of Political Blogs
In this paper, we present a series of semantic analyses of words in political blogs in the setting of categorization of two opposite political orientations: liberal vs. conservative. We classify nouns, verbs, adjectives and adverbs into semantic categories by using the General Inquirer dictionary. Then distributions of these categories and correlations among them are examined both within and be...
متن کامل‘The Right to Information’: A Malaysian Political Blog Readers’ Perspective
Political blogs are one of the pivotal alternative communication channels for political news in Malaysia. Many have argued that the mushrooming of political blogs nurtures the effective realization of human rights in the country. The paper studies the ‘Malaysian political blog readers–human rights’ relationship by exploring these questions: Has traditional mainstream media become obsolete with ...
متن کاملShedding (a Thousand Points of) Light on Biased Language
This paper considers the linguistic indicators of bias in political text. We used Amazon Mechanical Turk judgments about sentences from American political blogs, asking annotators to indicate whether a sentence showed bias, and if so, in which political direction and through which word tokens. We also asked annotators questions about their own political views. We conducted a preliminary analysi...
متن کاملBlog Classification with Co-training
In this project we use co-training to classify blogs by political leaning. We classify them into two pre-defined categories: liberal and conservative. We examine the performance of co-training versus normal supervised learning. We also look at using several different features to improve training.
متن کامل